Search CORE

21 research outputs found

Pathway and network analysis of more than 2500 whole cancer genomes.

The catalog of cancer driver mutations in protein-coding genes has greatly expanded in the past decade. However, non-coding cancer driver mutations are less well-characterized and only a handful of recurrent non-coding mutations, most notably TERT promoter mutations, have been reported. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancer across 38 tumor types, we perform multi-faceted pathway and network analyses of non-coding mutations across 2583 whole cancer genomes from 27 tumor types compiled by the ICGC/TCGA PCAWG project that was motivated by the success of pathway and network analyses in prioritizing rare mutations in protein-coding genes. While few non-coding genomic elements are recurrently mutated in this cohort, we identify 93 genes harboring non-coding mutations that cluster into several modules of interacting proteins. Among these are promoter mutations associated with reduced mRNA expression in TP53, TLE4, and TCF4. We find that biological processes had variable proportions of coding and non-coding mutations, with chromatin remodeling and proliferation pathways altered primarily by coding mutations, while developmental pathways, including Wnt and Notch, altered by both coding and non-coding mutations. RNA splicing is primarily altered by non-coding mutations in this cohort, and samples containing non-coding mutations in well-known RNA splicing factors exhibit similar gene expression signatures as samples with coding mutations in these genes. These analyses contribute a new repertoire of possible cancer genes and mechanisms that are altered by non-coding mutations and offer insights into additional cancer vulnerabilities that can be investigated for potential therapeutic treatments

Repository for Publications and Research Data

DSpace@MIT

Lund University Publications

Ghent University Academic Bibliography

Publikationer från Uppsala Universitet

eScholarship - University of California

Digitala Vetenskapliga Arkivet - Academic Archive On-line

UPF Digital Repository

Apollo (Cambridge)

Bern Open Repository and Information System (BORIS)

Online Research Database In Technology

University of Queensland eSpace

Analyses of non-coding somatic drivers in 2,658 cancer whole genomes.

Author: Abascal Federico
Akdemir Kadir C.
Alvarez Eva G.
Amin Samirkumar B.
Bader Gary D.
Baez-Ortega Adrian
Bandopadhayay Pratiti
Barenboim Jonathan
Beroukhim Rameen
Bertl Johanna
Boroevich Keith A.
Boutros Paul C.
Bowtell David D. L.
Brors Benedikt
Brunak Soren
Burns Kathleen H.
Busanovich John
Campbell Peter J.
Carlevaro-Fita Joana
Chakravarty Dimple
Chan Calvin Wing Yiu
Chan Kin
Chen Ken
Choi Jung Kyoon
CortesCiriano Isidro
Craft David
Deu-Pons Jordi
Dhingra Priyanka
Diamanti Klev
Dueso-Barroso Ana
Dunford Andrew J.
Edwards Paul A.
Estivill Xavier
Etemadmoghadam Dariush
Feuerbach Lars
Fink J. Lynn
Fonseca Nuno A.
Frenkel-Morgenstern Milana
Frigola Joan
Gambacorti-Passerini Carlo
Garsed Dale W.
Gerstein Mark
Getz Gad
Gonzalez-Perez Abel
Gordenin Dmitry A.
Guo Qianyun
Gut Ivo G.
Haan David
Haber James E.
Hamilton Mark P.
Haradhvala Nicholas J.
Harmanci Arif O.
Helmy Mohamed
Herrmann Carl
Hess Julian M.
Hobolth Asger
Hodzic Ermin
Hong Chen
Hornshoj Henrik
Hutter Barbara
Imielinski Marcin
Isaev Keren
Izarzugaza Jose M. G.
Johnson Rory
Johnson Todd A.
Jones David T. W.
Ju Young Seok
Juul Malene
Juul Randi Istrup
Kahles Andre
Kahraman Abdullah
Kazanov Marat D.
Kellis Manolis
Khurana Ekta
Kim Jaegil
Kim Jong K.
Kim Youngwook
Klimczak Leszek J.
Koh Youngil
Komorowski Jan
Korbel Jan O.
Kumar Kiran
Kumar Sushant
Lanzos Andres
Larsson Erik
Lawrence Michael S.
Lee Donghoon
Lee Eunjung Alice
Lee Jake June-Koo
Lehmann Kjong-Van
Li Shantao
Li Xiaotong
Li Yilong
Lin Ziao
Liu Eric Minwei
Lochovsky Lucas
Lopez-Bigas Nuria
Lou Shaoke
Lynch Andy G.
Macintyre Geoff
Madsen Tobias
Marchal Kathleen
Markowetz Florian
Martincorena Inigo
Martinez-Fundichely Alexander
Maruvka Yosef E.
McGillivray Patrick D.
Meyerson Matthew
Meyerson William
Miyano Satoru
Muinos Ferran
Mularoni Loris
Nakagawa Hidewaki
Navarro Fabio C. P.
Nielsen Morten Muhlig
Ossowski Stephan
Paczkowska Marta
Park Keunchil
Park Kiejung
Park Peter J.
Pearson John, V
Pedersen Jakob Skou
Pich Oriol
Pons Tirso
Puiggros Montserrat
Pulido-Tamayo Sergio
Raphael Benjamin J.
Reimand Juri
Reyes-Salazar Iker
Reyna Matthew A.
Rheinbay Esther
Rippe Karsten
Roberts Nicola D.
Roberts Steven A.
RodriguezMartin Bernardo
Rubin Mark A.
Rubio-Perez Carlota
Sabarinathan Radhakrishnan
Sahinalp S. Cenk
Saksena Gordon
Salichos Leonidas
Sander Chris
Schumacher Steven E.
Scully Ralph
Shackleton Mark
Shapira Ofer
Shen Ciyue
Shrestha Raunak
Shuai Shimin
Sidiropoulos Nikos
Sieverling Lina
Sinnott-Armstrong Nasa
Stein Lincoln D.
Stewart Chip
Stuart Joshua M.
Tamborero David
Tiao Grace
Torrents David
Tsunoda Tatsuhiko
Tubio Jose M. C.
Umer Husen Muhammad
Uuskula-Reimand Liis
Valencia Alfonso
Vazquez Miguel
Verbeke Lieven P. C.
Villasante Izar
von Mering Christian
Waddell Nicola
Wadelius Claes
Wadi Lina
Wala Jeremiah A.
Wang Jiayin
Warrell Jonathan
Waszak Sebastian M.
Weischenfeldt Joachim
Wheeler David A.
Wu Guanming
Yang Lixing
Yao Xiaotong
Yoon Sung-Soo
Yu Jun
Zamora Jorge
Zhang Cheng-Zhong
Zhang Jing
Zhang Xuanping
Zhang Yan
Zhao Zhongming
Zou Lihua
Publication venue: Nature
Publication date: 01/01/2020
Field of study

The discovery of drivers of cancer has traditionally focused on protein-coding genes1-4. Here we present analyses of driver point mutations and structural variants in non-coding regions across 2,658 genomes from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium5 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). For point mutations, we developed a statistically rigorous strategy for combining significance levels from multiple methods of driver discovery that overcomes the limitations of individual methods. For structural variants, we present two methods of driver discovery, and identify regions that are significantly affected by recurrent breakpoints and recurrent somatic juxtapositions. Our analyses confirm previously reported drivers6,7, raise doubts about others and identify novel candidates, including point mutations in the 5' region of TP53, in the 3' untranslated regions of NFKBIZ and TOB1, focal deletions in BRD4 and rearrangements in the loci of AKR1C genes. We show that although point mutations and structural variants that drive cancer are less frequent in non-coding genes and regulatory sequences than in protein-coding genes, additional examples of these drivers will be found as more cancer genomes become available

Publikationsserver der Universität Tübingen

Digitala Vetenskapliga Arkivet - Academic Archive On-line

UPF Digital Repository

Repository for Publications and Research Data

DSpace@MIT

Lund University Publications

Ghent University Academic Bibliography

Publikationer från Uppsala Universitet

UCL Discovery

Copenhagen University Research Information System

eScholarship - University of California

Apollo (Cambridge)

Bern Open Repository and Information System (BORIS)

University of St. Andrews - Pure

St Andrews Research Repository

Cancer LncRNA Census reveals evidence for deep functional conservation of long noncoding RNAs in tumorigenesis.

Author: Abascal Federico
Amin Samirkumar B.
Bader Gary D.
Barenboim Jonathan
Beroukhim Rameen
Bertl Johanna
Boroevich Keith A.
Brunak Soren
Campbell Peter J.
Carlevaro-Fita Joana
Carlevaro-Fita Joana
Chakravarty Dimple
Chan Calvin Wing Yiu
Chen Ken
Choi Jung Kyoon
Deu-Pons Jordi
Dhingra Priyanka
Diamanti Klev
Feuerbach Lars
Feuerbach Lars
Fink J. Lynn
Fonseca Nuno A.
Frigola Joan
Gambacorti-Passerini Carlo
Garsed Dale W.
Gerstein Mark
Getz Gad
Gonzalez-Perez Abel
Guo Qianyun
Gut Ivo G.
Haan David
Hamilton Mark P.
Haradhvala Nicholas J.
Harmanci Arif O.
Helmy Mohamed
Herrmann Carl
Hess Julian M.
Hobolth Asger
Hodzic Ermin
Hong Chen
Hong Chen
Hornshoj Henrik
Isaev Keren
Izarzugaza Jose M. G.
Johnson Rory
Johnson Todd A.
Juul Malene
Juul Randi Istrup
Kahles Andre
Kahraman Abdullah
Kellis Manolis
Khurana Ekta
Kim Jaegil
Kim Jong K.
Kim Youngwook
Komorowski Jan
Korbel Jan O.
Kumar Sushant
Lanzos Andres
Lanzos Andres
Larsson Erik
Lawrence Michael S.
Lee Donghoon
Lehmann Kjong-Van
Li Shantao
Li Xiaotong
Lin Ziao
Liu Eric Minwei
Lochovsky Lucas
Lou Shaoke
Madsen Tobias
Marchal Kathleen
Martincorena Inigo
Martinez-Fundichely Alexander
Maruvka Yosef E.
Mas-Ponte David
McGillivray Patrick D.
Meyerson William
Muinos Ferran
Mularoni Loris
Nakagawa Hidewaki
Nielsen Morten Muhlig
Paczkowska Marta
Park Keunchil
Park Kiejung
Pedersen Jakob Skou
Pedersen Jakob Skou
Pich Oriol
Pons Tirso
Pulido-Tamayo Sergio
Raphael Benjamin J.
Reimand Juri
Reyes-Salazar Iker
Reyna Matthew A.
Rheinbay Esther
Rubin Mark A.
Rubio-Perez Carlota
Sabarinathan Radhakrishnan
Sahinalp S. Cenk
Saksena Gordon
Salichos Leonidas
Sander Chris
Schumacher Steven E.
Shackleton Mark
Shapira Ofer
Shen Ciyue
Shrestha Raunak
Shuai Shimin
Sidiropoulos Nikos
Sieverling Lina
Sinnott-Armstrong Nasa
Stein Lincoln D.
Stuart Joshua M.
Tamborero David
Tiao Grace
Tsunoda Tatsuhiko
Umer Husen M.
Uuskula-Reimand Liis
Valencia Alfonso
Vazquez Miguel
Verbeke Lieven P. C.
von Mering Christian
Wadelius Claes
Wadi Lina
Wang Jiayin
Warrell Jonathan
Waszak Sebastian M.
Weischenfeldt Joachim
Wheeler David A.
Wu Guanming
Yu Jun
Zhang Jing
Zhang Xuanping
Zhang Yan
Zhao Zhongming
Zou Lihua
Publication venue: Commun Biol
Publication date: 01/01/2020
Field of study

Long non-coding RNAs (lncRNAs) are a growing focus of cancer genomics studies, creating the need for a resource of lncRNAs with validated cancer roles. Furthermore, it remains debated whether mutated lncRNAs can drive tumorigenesis, and whether such functions could be conserved during evolution. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, we introduce the Cancer LncRNA Census (CLC), a compilation of 122 GENCODE lncRNAs with causal roles in cancer phenotypes. In contrast to existing databases, CLC requires strong functional or genetic evidence. CLC genes are enriched amongst driver genes predicted from somatic mutations, and display characteristic genomic features. Strikingly, CLC genes are enriched for driver mutations from unbiased, genome-wide transposon-mutagenesis screens in mice. We identified 10 tumour-causing mutations in orthologues of 8 lncRNAs, including LINC-PINT and NEAT1, but not MALAT1. Thus CLC represents a dataset of high-confidence cancer lncRNAs. Mutagenesis maps are a novel means for identifying deeply-conserved roles of lncRNAs in tumorigenesis

Repository for Publications and Research Data

DSpace@MIT

Lund University Publications

Publikationer från Uppsala Universitet

Ghent University Academic Bibliography

Digitala Vetenskapliga Arkivet - Academic Archive On-line

UPF Digital Repository

Apollo (Cambridge)

Bern Open Repository and Information System (BORIS)

Global network construction.

Author: Lieven P. C. Verbeke (776139)
Jimmy Van den Eynden (776140)
Ana Carolina Fierro (213714)
Piet Demeester (404606)
Jan Fostier (404603)
Kathleen Marchal (1406)
Publication venue
Publication date: 01/01/1970
Field of study

(a) Conversion of binary data to a network representation. All continuous data are mapped to a binary representation with ‘1’ (colored squares) corresponding to a gene with a value deviating from normal for a particular sample. Each ‘1’ in the binary datasets is converted to an undirected link (solid line) between a gene node and a sample node. Prior knowledge, derived from public gene interaction repositories, is available in the form of undirected links (dashed grey line) between genes. Characters a-g correspond to gene IDs, S1-S3 represent sample IDs. (b) Construction of the global network. The network representations of the binary datasets and the prior knowledge network are merged to constitute a single comprehensive network representation. Gene nodes originating from the input datasets are connected to the corresponding gene in the prior knowledge interaction network (dashed yellow lines). (c) The resulting adjacency matrix representation of the undirected global network. For clarity, individual gene and sample identifiers are omitted. NET (grey) = genes from the prior knowledge interaction network, S (dark blue) = samples, EXP (green) = genes from the gene expression dataset, CNV (pink) = genes from the copy number dataset, MUT (light blue) = mutated genes, MET (orange) = methylated genes. (d) The similarity matrix derived from the adjacency matrix, indicating the parts of the similarity matrix that are relevant for the pathway ranking task.</p

FigShare

The 20 highest ranking pathways for the two most extreme ovarian cancer survival-based subtypes.

Author: Ana Carolina Fierro (213714)
Jan Fostier (404603)
Jimmy Van den Eynden (776140)
Kathleen Marchal (1406)
Lieven P. C. Verbeke (776139)
Piet Demeester (404606)
Publication venue
Publication date
Field of study

The contribution of each component to the total score is indicated in a different color bar: mRNA expression (dark blue), copy number (light blue), mutation (green) and methylation (yellow).</p

FigShare

The 20 highest ranking pathways for each of the four breast cancer subtypes.

Author: Ana Carolina Fierro (213714)
Jan Fostier (404603)
Jimmy Van den Eynden (776140)
Kathleen Marchal (1406)
Lieven P. C. Verbeke (776139)
Piet Demeester (404606)
Publication venue
Publication date
Field of study

The aggregate score assigned to each pathway can be decomposed into 4 probabilistic components. The contribution of each component to the total score is indicated in a different color bar: mRNA expression (dark blue), copy number (light blue), mutation (green) and methylation (yellow).</p

FigShare

Pathway relevance scoring.

Author: Ana Carolina Fierro (213714)
Jan Fostier (404603)
Jimmy Van den Eynden (776140)
Kathleen Marchal (1406)
Lieven P. C. Verbeke (776139)
Piet Demeester (404606)
Publication venue
Publication date
Field of study

Given a subset of the global similarity matrix (Sexp Scnv, Smut, Smet, see <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0133503#pone.0133503.g001" target="_blank">Fig 1</a>) and a set of genes (a,b,d) constituting a pathway P, a score for each input dataset is calculated by first removing genes from Sexp Scnv, Smut, Smet that do not belong to the pathway and then taking the average of all remaining values in Sexp Scnv, Smut, Smet. This process is repeated for n randomly generated gene sets (with the same number of genes as the pathway P) yielding n scores for each input dataset. The random pathway scores are used to calculate a p-value for obtaining the pathway scores purely by chance. The resulting p-values are multiplied, resulting in a single aggregated pathway score.</p

FigShare

Pathway scores compared across breast cancer subtypes for a selection of pathways.

Author: Ana Carolina Fierro (213714)
Jan Fostier (404603)
Jimmy Van den Eynden (776140)
Kathleen Marchal (1406)
Lieven P. C. Verbeke (776139)
Piet Demeester (404606)
Publication venue
Publication date
Field of study

Dark blue = Basal-like, light blue = HER2, green = Luminal A and yellow = Luminal B.</p

FigShare